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Empirical processes indexed by 
estimated functions 
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Vrije Universiteit Amsterdam and University of Washington 

Abstract: We consider the convergence of empirical processes indexed by 
functions that depend on an estimated parameter rj and give several alterna- 
tive conditions under which the "estimated parameter" can be replaced 
by its natural limit rjo uniformly in some other indexing set 0. In particular 
we reconsider some examples treated by Ghoudi and Remillard [Asymptotic 
Methods in Probability and Statistics (1998) 171-197, Fields Inst. Commun. 44 
(2004) 381-406] . We recast their examples in terms of empirical process theory, 
and provide an alternative general view which should be of wide applicability. 



1. Introduction 

Let Xi, . . . , Xn be i.i.d. random elements in a measurable space {X,A) with law 
P, and for a measm'able function / : A" — > R let the expectation, empirical measure 
and empirical process at / be denoted by 

r 1 " 

Pf = / fdP, Pnf - - 5] /(X,), Gnf = V^(P„ - P)f. 

" 1=1 

Given a collection {fg^jj : 6 G &,!] E H} of measurable functions fg,,j : X ^ M. 
indexed by sets O and H and "estimators" rjn, we wish to prove that, as rt — > oo, 

(1) sup 

Bee 

Here an "estimator" r]n is a random element with values in H defined on the same 
probability space as Xi, . . . ,X„, and 770 G -ff is a fixed element, which is typically 
a limit in probability of the sequence ?7„. 

The result ([T]) is interesting for several applications. A direct application is to 
the estimation of the functional 9 1— > P fg^n- If the parameter rj is unknown, we may 
replace it by an estimator and use the empirical estimator Pnfg,ri„- The result 
(HI) helps to derive the limit behaviour of this estimator, as we can decompose 

(2) V"(IPri/e,,7„ - Pfe.no) = Gn{fg,r,„ - fg.no) + '^nfg.no + VnP{fg,r,„ - fe.no)- 

If HI) holds, then the first term on the right converges to zero in probability. Un- 
der appropriate conditions on the functions fg^rjoj the second term on the right 
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will converge to a Gaussian process by the (functional) central limit theorem. The 
behavior of the third term depends on the estimators r]n, and would typically 
follow from an application of the (functional) delta-method, applied to the map 

In an interesting particular case of this situation, the functions fe^rj take the form 

for maps 6* : — > R and each rj G H being a map rj : X ^ M.'^. The realizations 
of the estimators r]n are then functions x i-^ rin(x) = rjn{x; Xi, . . . , Xn) on the 
sample space X and can be evaluated at the observations to obtain the random 
vectors rin{Xi), . . . ,rjn{Xn) in R''. The process {^nfe,riru : €E 8} is the empirical 
measure of these vectors indexed by the functions 9. For instance, if Q consists 
of the indicator functions l(_oo,e] for ^ ^ H^'^i then this measure is the empirical 
distribution function 

n 
i=l 

of the random vectors ?7„(Xi), . . . , 77n(X„). The properties of such empirical proces- 
ses were studied in some generality and for examples of particular interest in Ghoudi 
and Remillard [i,0]- Ghoudi and Remillard ^ apparently coined the name "pseudo- 
observations" for the vectors 77„ (Xi ),..., 77„(X„). The examples include, for in- 
stance, regression residuals, Kendall's dependence process, and copula processes; 
see the end of Section 2 for explicit formulation of these three particular examples. 
One purpose of the present paper is to extend the results in these papers also to 
other index classes 8 besides the class of indicator functions. Another purpose is to 
recast their results in terms of empirical process theory, which leads to simplification 
and alternative conditions. 

A different, indirect application of ([T]) is to the derivation of the asymptotic 
distribution of Z-estimators. A Z-estimator for 9 might be defined as the solution 
9n of the equation Pnf9,ri„ = 0, where again an unknown "nuisance" parameter rj 
is replaced by an estimator rjn- In this case ^ shows that 

Pn/^„,„„ - P„/,^„,,„ = P(/^„,,„ - fe^^J + 0P(1/V^), 

so that the limit behavior of 9n can be derived by comparison with the estimating 
equation defined by Pn/e,r;o (with rjo substituted for ri„). The "drift" sequence 
^(■^9„,r,„ ~ /e„,»?o)' '^t'ich will typically be equivalent to P{feo.v^ ~ feo.vo) up to 
order op{l/y/n), may give rise to an additional component in the limit distribution. 

The paper is organized as follows. In Section[2]we derive general conditions for the 
validity of ([1]) and formulate several particular examples to be considered in more 
detail in the sequel. In Section [3] we specialize the general results to composition 
maps. In Section [4] we combine these results with results on Hadamard differen- 
tiability to obtain the asymptotic distribution of empirical processes indexed by 
pseudo observations. Finally in Section [5] we formulate our results for several of the 
particular examples mentioned above and at the end of Section [51 

2. General result 

In many situations we wish to establish ([1]) without knowing much about the nature 
of the estimators rjn, beyond possibly that they are consistent for some value 770. 
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For instance, this is true if ([T]) is used as a step in the derivation of M— or Z 



estimators. (Cf. Van der Vaart and WeUner [I3 and Van der Vaart Then an 

appropriate method of estabhshing ^ is through a Donskcr or entropy condition, 
as in the foUowing theorems. Proofs of the Theorems 12.11 and 12.21 can be found in 
the mentioned references. 

Both theorems assume that ry„ is "consistent for 779" in the sense that 

(3) supP(/e.„„-/e,^o)'^pO. 

eee 

Theorem 2.1. Suppose that Hq is a fixed subset of H such that Pr(77„ G Hq) — > 1 
and suppose that the class of functions {fe,rj ■ 9 G &,ri G Hq} is P-Donsker. If ([3]) 
holds, then ([T]) is valid. 

For the second theorem, let A^(e, L2{P)) and (e, L2{P)) be the e-covering 
and e-bracketing numbers of a class T of measurable functions (cf. Pollard 's'l and 
van der Vaart and Wellner [l3|) and define entropy integrals by 

(4) J{5,T,L2)^ [ svcp J log N{e\\F\\Q a, T,L2{Q))de, 

Jo Q ^ 

(5) J[]((5,.F,L2(P)) = ^'y^logiV[](e|jF||p,2,.F,L2(P))&. 

Here F is an arbitrary, measurable envelope function for the class J^: a measurable 
function F : X ^ R such that < F{x) for every f G T and x e A". We 

say that a sequence Fn of envelope functions satisfies the Lindeberg condition if 
PFl = 0(1) and PFlXp^^^^^ for every e > 0. 

Theorem 2.2. Suppose that iJ„ are subsets of H such that Pr(?7„ €E Hn) — > 1 
and such that the classes of functions Tn ~ {fe.rj : ^ G 6,f? G Hn} satisfy either 
J[.]{Sn,J^n,L2{P)) 0, or J{Sn,J-'n, L2) ^ for cvcry sequence 5„ 0, relative 
to envelope functions that satisfy the Lindeberg condition. In the second case also 
assume that the classes Tn are suitably measurable (e.g. countable). If ^ holds, 
then ^ is valid. 

Because there are many techniques to verify that a given class of functions is 
Donsker, or to compute bounds on its entropy integrals, the preceding lemmas give 
quick results, if they apply. Furthermore, they appear to be close to best possible 
unless more information about the estimators ri„ can be brought in, or explicit 
computations are possible for the functions fe^n- 

In some applications the estimators rjn are known to converge at a certain rate 
and/or known to possess certain regularity properties (e.g. uniform bounded deriva- 
tives). Such knowledge cannot be exploited in Theorem 12. 11 but could be used for 
the choice of the sets Hn in Theorem 12.21 We now discuss an alternative approach 
which can be used if the estimators ry„ are also known to converge in distribution, 
if properly rescaled. 

Let _ff be a Banach space, and suppose that the sequence \/n{r]n — ?7o) converges 
in distribution to a tight, Borel-measurable random element in H . The "convergence 
in distribution" may be understood in the sense of Hoffmann-j0rgensen, so that ry„ 
need not be Borel-measurable itself. 

The tight limit of the sequence \/n(rin—rio) takes its values in a cr-compact subset 
Hq C H. For 9 E Q, ho G Hq, and S > define a sequence of classes of functions by 

(6) Tn{9,hQ,5) = {/e,,,„+„-i/2,j - fe,,^o+n-^/^ho ■ h e H, \\h- ho\\ < S}. 
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Let Fn{0, ho, S) be arbitrary measurable envelope functions for these classes. 

Theorem 2.3. Suppose that the sequence ^/n{rin — rjo) converges in distribution to 
a tight, random element with values in a given a-compact subset Hq of H . Suppose 
that 

(i) supg |G„(/e_^^+„-i/2;,,, - fe,no)\ for every ho E Hq. 
(a) supg |G„F„(6', ho, S)\ for every S > and every ho G Hq; 
(Hi) supgsnpf^^^j^ y/nPFn{0,ho,Sn) for every 5„ ^ and every compact 
K c Ho; 

Then ([T]) is valid. 

Proof. Suppose that y/n{r]n — Vo) Z and let e > be fixed. There exists a 
compact set K C Hq with P{Z G K) > 1 — e and hence for every S > 0, with 
the set of all points at distance less than S to K, 

liminf Pr(V^(77„ - ?;o) G K^^^) > 1 - e. 

n — ^oo 

In view of the compactness of K there exist finitely many elements hi, ... , hp G 
K C Hq (with p — p{6) depending on S) such that the balls of radius 6/2 around 
these points cover K. Then K^^^ is contained in the union of the balls of radius S, 
by the triangle inequality. Thus, with B(h, 6) denoting the ball of radius S around 
h in the space H, 

piS) 

{V^iVn - rio) e K^/^} C U {vn G Bim + n-^/^h„ 5)). 

i=l 

It follows that with probability at least 1 — e, as n — s- cx), 
sup|(G„(/e,,,„ - /e,7,o)l 

9 

<supmax sup |G„(/e +„-i/2;j - /e,,,o)| 
e » \\h-h,\\<s 

< sup max sup 

9 « ||?i-/ii||<<5 

+ \'^n{f9^rto+n-y^h, " /e,J?o)l 

< supmax |G„F„(6', /li, (5)1 + 2sup sup y/nPFn{9,ho,S) 

9 « e hoeK 

+ supmax|G„(/e +„-i/2^^ ~ fe,7,o)\, 

9 » 

where in the last step we use the inequality |G„/| < |G„F| + 2y/nPF, valid for any 
functions / and F with |/| < F. The maxima in the display are over the finite set 
i — 1, . . . ,p{5), and the elements hi, ... , hpi^g^ G K depend on S. By assumptions 
(i) and (ii) the first and third terms converge to zero as n — s- oo, for every fixed 
5. It follows that there exists 5„ | such that these terms with Sn substituted 
for d converge to 0. For this 6n, all three terms converge to zero in probability as 
n — !■ oo. □ 

The rate of convergence ^/n in the preceding theorem may be replaced by another 
rate, with appropriate changes in the conditions, but the rate ^/n appears natural in 
the following context. For more general metrizable topological vector spaces there 
are similar, but less attractive, results possible. 



\'^n{f9,r]o+n-^/^h " /s.rjo+n- i/^/ii ) I 
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The two conditions (i), (ii) of Thcorem l2 . 31 concern the empirical process indexed 
by the classes of functions 

(7) {fe,rio+n-U2ho - fe.va ■ ^ E Q} , 

(8) {Fnie,ho,S) : 9ee}. 

These classes are indexed by Q only, and hence Theorem 12.31 if applicable, avoids 
conditions for ([1]) that involve measures of the complexity of the class {/e,,, : G 
6, ?7 e H} due to the parameter rj € H. 

Condition (iii) of Theorem 12.31 involves the mean of the envelopes of the classes 
^n{d, ho, 5). For the minimal envelopes this condition takes the form 

(9) sup sup V^F sup \fe.r,o+n-UVi- fe.r,o+n-^/2ho\ ^ 

e hoeK ||h-/io||<<5„ 

for all (5„ I 0. This is an "integrated uniform local Lipschitz assumption" on the 
dependence rj i— s- /e,?;. In some applications it may be useful not to use the minimal 
envelope functions. The lemma is valid for any envelope functions, as long as the 
same envelopes are used in both (ii) and (iii). 

The set K in (iii) or ([H]) is a compact set in the support of the limit distribution 
of the sequence ^/n{r]n — ?7o)- In some cases condition (iii) may be valid for any 
compact K C H, whereas in other cases more precise information about the limit 
process must be exploited. For instance, if the sequence \pn(r\n — 770) converges in 
distribution to a tight zero-mean Gaussian process G in the space H — £°°{T) of 
bounded functions on some set T, then K may be taken to be a set of functions 
z : T that is uniformly bounded and uniformly equicontinuous relative to the 
semimetric with square cf{s,t) = E{Gs — GtY (and T will be totally bounded for 
d). Cf. e.g. van der Vaart and Wellner [13], page 39. 

Condition (iii) is an analytical condition, whereas conditions (i) and (ii) are 
empirical process conditions. In many cases the latter pair of conditions can be 
verified by standard empirical process type arguments. For reference we quote two 
lemmas that allow handling the empirical process indexed by a sequence of classes, 
as in ([8]) or (III). (For proofs see e.g. van der Vaart [10, JjJ.) Both lemmas apply to 
classes Tn of measurable functions f : X such that 

(10) sup Pf 0. 

Lemma 2.1. Suppose that the class of functions lj„-?>i is P-Donsker. If (|10p 
holds, then supjgjp^ |G„(/)| — +p 0. 

Lemma 2.2. Suppose that either J^.-^(6„, J-n, L2{P)) ^ or J{Sn,J-n, L2) for 
all S„ I relative to envelope functions Fn that satisfy the Lindeberg condition. In 
the second case also assume that each class Tn is suitably measurable. If (|10p holds, 
then supjgjF^^ l'G„(/)| 0. 

Example 1 (Regression residual processes). Suppose that (Xi, Yi ),..., (X„, 
Yn) are a random sample distributed according to the regression model Y = g^{X) + 
e. For given estimators r]„ we can form the residuals Ci = Yi — gr^^{Xi) and may be 
interested in the empirical process corresponding to ei, . . . , e„, i.e. for a collection 
of functions : M ^ R we consider the process {n~^ ^"=1 ^(^i) • S G ©}• This 
fits the general set-up with the functions fo^ri defined as fg^jjix, y) — Oi^y — gri{xj). 



Random empirical processes 



239 



In many cases it will be possible to apply Theorem l2.1l For instance, if a; G M"*, 
is a polynomial in x, and O is the class of indicator functions l(_oo,6i] for ^ G K, then 
the functions arc the indicator functions of the sets {{x,y) : y — g^^ix) —0< 0}. 
Because the set of functions {x, y) i— > y—gri{x)—9 is contained in a finite-dimensional 
vector space, it is a VC-class, and hence so are their negativity sets (e.g. van der 
Vaart and Wellner Lemma 2.6.18). Thus the class of functions fe.ri is Donsker, 
and Theorem 2.1 can be applied directly. 

Example 2 (Kendall's process). Let rjn be the empirical distribution function of 
a random sample Xi, . . . , Xn from a distribution rjo on M"^. Barbe, Gcncst, Ghoudi 
and Remillard 0] and Ghoudi and Remillard Q study the behavior of the empirical 
distribution function Kn of the pseudo-observations r]n{Xi), 

1 " 

^n(e) = -Vl{?7„(X,)<n, ee[0,i], 

n ^ — ' 

and the resulting Kendall's process 

(11) ^/Ti{K^{B)-K{B)), 0e[O,l] 

where K{B') = P{r]o{X) < 6). This fits the general set-up with /g^^ the composition 
function /g^^ = 9orj, and 9 the indicator function l(_oo,e] (where we abuse notation 
by using the symbol in two different ways). 

An attempt to apply Theorcm l2.1l to this problem would lead to the consideration 
of the class of all indicator functions of sets of the form {x € R"* : 77(2;) < 6*} for 
r] ranging over the cumulative distribution functions on and 9 E [0,1]. This 
class is similar to the collection of all "lower layers" in M"^, and, unfortunately, 
fails to be Donsker for most distributions (cf. Dudley page 264, 373 or Dudley 
Q). In this case it appears to be necessary to exploit the limit behaviour of the 
sequence \Ai('7n ~ % ) • Ghoudi and Remillard have shown that ([T]) is valid in this 
case, under some strong smoothness assumptions on the underlying measure ryo. In 
Sections |4] and [5] we rederive some of their results by empirical process methods 
using Theorem 12.31 

We also consider the empirical process of the variables ?7„(Xi), . . . ,?7„(X„) in- 
dexed by classes of functions other than the indicators l(_oo.6i]- H the indexing func- 
tions are smooth, then this empirical process will converge even without smoothness 
conditions on rjo. A proof can be based on Theorem l2.3l 

Example 3 (Copula processes). Suppose that Xi, . . . , Xn are a sample from a dis- 
tribution 7^0 on M''. Write Xi — (Xi,i, . . . , Xi^d) and let 770,1, • ■ • , VoM be the marginal 
distributions. The copula function C associated with 770 is the distribution function 
of the vector (770,1(^1,1), . . . ^VoA^i.d)) , i-e. with ?7(7](u) = inf{a: : m.jix) > u} for 
we [0,1], 

C(ui, ...,Ud)= miVo.liui), ■ ■ ■,Va,d{ud)) 

for {ui, . . . ,Ud) G [0,1]''. For j = l,...,d let rjnj be the empirical distribution 
function of Xij, . . . , Xn,j (on M), and let ?]„ be the empirical distribution function 
of Xi, . . . , Xn (on M''). Then a natural estimator C„ of C is given by 

1 " 

C«(w) = - Vl{e„,» < u}, ue[0,l]^ 

n ^ — ^ 
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for the "pseudo-observations" e„_,; — {r].n,i{Xi^i), . . . ,r]nM{Xi^d)) ■ The resulting 
"copula processes" , 

(12) V^{Cn{u)-C{u)), ue[0,lf, 

have been considered by Stute Ganssler and Stute [H], and Ghoudi and Remil- 
lard This example can be treated using Theorem 12.31 but also with the more 
straightforward Theorem 1 2. 11 or even by employing the theory of Hadamard differ- 
entiability, as in Chapter 3.9 of Van der Vaart and Wellner [12| . 

3. Composition 

In this section we consider the case where the functions /g.,, take the form 

(13) fe.r,{x) = 0{v{x)), 

for 6 ranging over a class 8 of functions : M'' M and 77 ranging over a class H of 
measurable functions : A" — > R''. We first give general conditions for the validity 
of condition (iii) of Theorem 12.31 and next consider also the conditions (i) and (ii) 
for the special cases of functions 6 that are Lipschitz and monotone, respectively. 
We develop these results for the case that the sequence -y/n('7n ~ ^70) converges in 
distribution in the space H = £°°{X,'&.'^) of uniformly bounded fimctions z : X ^ 
Mf^, equipped with the uniform norm \\z\\ = sup^^;^\\z{x)\\. (Variations of these 
results are possible. For instance, M'' could be replaced by a more general Banach 
space, and H could be equipped with a weighted uniform norm.) 

3.1. Condition (i) 

For fs,jf taking the form ([13]), condition (i) of Theorem 12.31 takes the form 

(14) sup |G„,q/| ->q 

for Q = P o ijiQ, hQ)~^, G„^Q the empirical process of a random sample from the 
measure Q, and J-n the class of functions 

(15) - {{y, z) ^ e{y + 7i-^'^z) - e(v) : e e}. 

Condition (i) requires that (1141) is valid for every fixed choice of /lo G Ho^ i-c for 
every measure Q determined as the law of {r}{){X), ho{X)) for some ho E Ho and X 
distributed according to P. 

This situation is of the form considered in Lemmas l2.1l and l2.2| and both lemmas 
may be applicable in a given setting. It is not especially helpful to restate these 
lemmas for the present special situation. Instead, we give one easy to check set of 
sufficient conditions. This covers VC-classes Q, and much more. 

If Qenv : K'' ^ M is an envelope function for 0, then 

(16) F„{y, Z) = Qenviy + n-^/^z) + Oenviy) 

is an envelope function for J-n- (A crude one, because we do not exploit that the 
functions in J-'n are differences.) 
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Lemma 3.1. Suppose that J{1,0, L2) < 00, that Q is suitably measurable, that 
-P(0eni>°'7o)^ < 00, and that the functions Qenv°{vo+n^^^^ho) satisfy the Lindeberg 
condition in L2{P), for every Hq G Hq. If 

sup P{e o {rjo + n-^/^ho) - e o -qoY ^ 0, 
eee 

for every Hq € Hq, then condition (i) of Theorem 12.31 is satisfied for the functions 
fe^n given by (US]). 

Proof. It suffices to prove (fT4|) for the classes given in pS]) . The class JF„ is 
contained in the difference of the classes 

These classes possess envelope functions F^^ and F" defined by 

F^{y,z) = eenv{y + n-^/^z), 
F"{y,z) = e,nv{y). 

The uniform entropy of relative to F" is finite by assumption. The uniform 
entropy of J^i relative to is exactly the same, as the law of F + n'^^^Z runs 
through all possible laws on R'* if the law of {Y, Z) runs through all possible laws 
on R"^ X M''. The uniform entropy of JF„ relative to Fn is bounded by the sum of 
the uniform entropies of JF,'j and T" . (Cf. e.g. Theorem 2.10.20 of van der Vaart 
and Wellner [12]-) Now apply Lemma [221 □ 



3. 2. Lipschitz functions 6 

Assume that every function 6* : M'' ^ R in the class Q is uniformly Lipschitz in 
that 

(17) \d{ri)-0{r2)\<\\ri-r2\\. 
Then, for every x G X, 

e{Tioix) + n-i/2/z(x)) - e{r)oix) + n-'/^ho{x)) 

The norm in the right side is bounded by the supremum norm — /io|| on £°°{X, 
R''). It follows that the classes !Fn{d, Hq, 5) as in ^ possess envelope functions 

(18) Fr,{e,ho,S)^5/V^. 

Theorem 3.1. If O is a suitably measurable collection of uniformly bounded, uni- 
formly Lipschitz functions 6 : — > R such that J{1,Q, L2) < 00 (relative to a 
constant envelope function), rjo £ {X jR"^) , and the sequence y^{r]n — ?/o) con- 
verges weakly in £°°{X,M.'^) to a tight random element, then 

sup|G„(0(?7„) - 6'(77o))| 0. 



< 



\\n[x) - no[x)\\ 
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Proof. With the envelope functions F„(0, ho, 5) as defined in p^ . condition (ii) of 
Theorem 12.31 is triviaUy satisfied because the envelopes are actually constants, and 
the validity of condition (iii) is immediate. 

By assumption we can choose the envelope function of Q equal to a constant 
and J{1,Q, L2) < 00. This suffices for the verification of most of the conditions of 
Lemma [3. f I Finally, it suffices to note that 

P{do{r^o+n-'^^ho)-9oT^„Y < \\hof/n. 



By Lemma 13.11 we conclude that condition (i) of Theorem 12.31 is also satisfied, 
whence the theorem follows from Theorem 12.31 □ 



For the verification of condition (i) of Theorem 12.31 it suffices to consider the 
functions 6 on the range of the functions 770 + ho/ ^/n for a fixed ho in the support 
of the limit distribution of the sequence y/n{rj„ — r?o)- Thus we may restrict the 
functions 6* to a subset of M'' that contains the ranges of these functions and in- 
terpret the condition J{1,Q, L2) < 00 in Lemma |3 . 1 1 accordinglv. In particular, in 
Theorem l2.3l we may replace this condition by the condition that J(l, Qk, L2) < 00 
for every norm-bounded subset K C M."^, where Qk is the collection of restrictions 
6* : i^: ^ R of the functions € O. 

Any collection of uniformly bounded, Lipschitz functions 9 : K R on a com- 
pact interval K satisfies J(l, 8, L2) < 00. (Cf. e.g. van der Vaart and Wellner [l^ . 
page 157.) Thus in the case that d = 1 the assertion of the preceding theorem is 
true for any collection of uniformly bounded Lipschitz functions. 

For d > I further restrictions on the class O may be necessary. For instance, any 
subset of the unit ball in the Holder space C" {K) for a compact interval K C M.'^ 
possesses a finite uniform entropy integral provided a > d/2. (Cf. e.g. van der Vaart 



and Wellner [1^, page 157.) The assertion of the preceding theorem is also true for 
such a class. 

There are many other examples of classes of Lipschitz functions with finite uni- 
form entropy integrals. For instance, VC-classes of Lipschitz functions. 

3.3. Monotone functions 9 

Assume that every function 6* : M'' ^ R in is the survival function 0{x) = 
J l[3;,oo) d9 of a subprobability measure on M'*. Then each 9 is nonincreasing in each 
of its arguments, li H = i°°{X, M'') equipped with the uniform norm relative to the 
max- norm on W^, then 

9{7io + n-^^^h)-9{fio+n-^/^ho) 

< 9{7]o + n^^'^ho - n-^'^\\h~ ho\\) - 0(770 + n-^/^ho + n~^'^\\h~ ho\\). 

It follows that the classes Tn{9, ho, S) possess envelope functions, with 6 the vector 
{S,...,6), 

Fn{9, ho, 5) = 9{r]o + n~^/^ho - - 9{'qo + n'^/^ho + n-^''^5). 

In order to verify condition (iii) of Theorem 12. 31 we assume that for given (possibly 
infinite) a < 5 in W'' and every 5n i and compact set K C Hq U {0}, 

(19) sup sup VnP {\„+n~l/2ha^t+n-U2S^ - l^o+„-i/2,,o<t) ^ 0. 

tm.'^.a<t<bhoeK 
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Theorem 3.2. Let be a collection of survival functions 9 : — > [0, 1] of subprob- 
ability measures supported on an interval (a, b) C M"*. // the sequence y/n{rjn — ?7o) 
converges in distribution in £°"[X ^W^) to a tight Borel measure concentrating on the 
a-compact set Hq, and (|19p holds for every dn I and every compact K C i?oU{0}, 
then 

sup|G„(6l(77„) - 6'(77o))| ->p 0. 

e 

Proof. The survival functions of subprobabiUty measures are in the convex hull of 
the set of indicator functions l[t,oo): which is a VC-class. Therefore the entropy 
integral J(1,0,L2) of Q relative to a constant envelope is finite. (Cf. e.g. van der 
Vaart and Wellner [13], page 145.) 

Defining Fn{0, rjQ, 5) as in the display preceding the theorem, we can write 

PK(0,?7o,'5) 

= j P{^^^^,^{m + 7i-^l^ho-n-^'H)-\_^^,^{7^o + n-^/^ho))de{s) 
<\\e\\ sup P{li-^,s]{m + n-^'^ho-n-^'^5)^l^_^^,^{T^o + n~^'^ho)). 

a<.s<b 

By assumption pj)) the right side converges to zero faster than l/^/n, for every 
S = 6n I 0, uniformly in ho S K, and uniformly in 9 because the total variation 
norms \\9\\ are uniformly bounded. This verifies condition (iii) of Theorem l2.3l 
Because 9 is monotone with range contained in [0, 1], 

F(%o+""'/''5)-%o))' < sup P{li^o.,s]{vo-n-'^^S)-^_^^s]ivo)). 

a<s<b 

By assumption with ho = this converges to zero faster than l/^/n for every 
sequence J, 0. This can be seen to imply that the expression in the display (which 
does not have the leading converges to zero also for fixed 5. By monotonicity 
of 9 we can bound \9{r]Q + ho/y/n) — 9{i]o)\ by \9{r]o — S/y/n) — 9{rio)\ for S — \\ho\\. 
By Lemma [3. II we now conclude that condition (i) of Theorem 12.31 is satisfied. 

In the present case the envelope functions Fn{9, ho, S) are equal to the functions 
/e,?7o+n-i/2/i — /e,?7o-n-i/2/i for h = ho — S. Therefore, the validity of condition (ii) 
of Theorem 12.31 follows by the same arguments as used for the validity of condition 
(i). □ 

Condition (|19p is a uniform Lipschitz condition on the distribution functions of 
the variables rio{X) + ho{X) / -/n. If the distribution of rio{X) is smooth, then we 
might expect that the distribution functions of the perturbed variables rio{X) + 
ho{X)/^Jn will be smooth as well. However, this appears not to be true in general, 
and it will usually be necessary to exploit some information about the functions ho. 
(We need to consider functions in the support of the limit measure of the sequence 
\/n{r]n — fjo)-) In this respect the conditions of Theorem 13.21 for composition with 
monotone functions are much more stringent than the conditions of Theorem 13.11 
for the composition with Lipschitz functions. 

The condition (|19[) is in terms of the indicator functions l(_oo,s], ^nd would have 
exactly the same form if we considered only indicator functions 9 = l(_oo,6i], rather 
than general monotone functions. Thus the restrictive condition is connected to 
studying the classical empirical process. 

The following lemma allows the verification of condition (fTO|) in many cases. It 
will also be used in the next section to prove applicability of the delta-method. The 
lemma is similar to Lemma 5.1 of Ghoudi and Remillard ]&. 
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Lemma 3.2. Suppose that X,Y,Yt (with t > 0) are real-valued random variables 
on a common probability space such that 

(i) X possesses a Lebesgue density f that is continuous in a neighbourhood of x; 

(ii) \\Yt - rlico ^ and ||y||oo < 

(Hi) the conditional distribution ofY given X = s can be represented by a Markov 
kernel K{s, ■) such that the map s ^ K{s, •) is continuous at x for the weak 
topology. 

Then for every continuous function g : M — > M and every converging sequences 
Xt ^ X, at ^ a and < bt ^ b, as t ^ 0, 

j'Eg{Yt)l^^<x+tatYt<xt+tbt j g{y)K{x,dy) fix). 

Proof. First consider the case that 1^ = y for every t. By the definitions of K and 
/, we can write 

■^Efif (5^) Ixt <X+tat r <xt +tbt 

^\ j ^{9{Y)'^{xt-s)/t<atY<{xt-s)/t+bt\X = s) f{s)ds 

= j j 9{y)'i-u<aty<u+bt K{xt - ut, dy) f{xt - ut) du. 

The inner integral is equal to 'Eig{Yi)lu<:atYt<u+bi for Yj possessing the law K{xt — 
ut, ■). By assumption atYt converges in distribution to the law of al^ for Y possessing 
the law K{x, ■). It follows that the inner integral converges to J g{y)lu<ay<u+b K{x, 
dy) for any {u, b) such that u and u + b are not among the atoms of the law of aY. 
This includes almost every u for every fixed b. Because Y has bounded range, the 
double integral can he rc^stricted y in a compact set and hence u in a compact set; 
the argument Xt — ut fo / is then restricted to a neighbourhood of x. Therefore, 
we can apply the dominated convergence theorem to see that the right side of the 
display converges to 

g{y)'^u<ay<u+bK{x,dy) f{x)du. 

This reduces to 6 / g{y) K{x, dy) f{x) by Fubini's theorem. This concludes the proof 
of the lemma for the case that Yj = Y . 

Because g is continuous, Y possesses bounded range and \(Yt — Y\^ — > we 
have that \g(Yt) - 5(^)11 <x> ^ 0. Therefore 

^\g^t) - g{Y)\\^^^x+tatYt<xt+tbt 



t 



< o(l)i Vr{xt - tatWYtW^ <X<xt + t + tat||i"t||oo). 



This converges to zero as X has a density that is bounded on bounded intervals. 

Finally the difference lxt<x+tatYt<xt+tbt - ixt<x+tatY<xt+tbt is nonzero only if 
X + tatY is in the union of intervals of total length bounded by tatHYt — y||oo in a 
neighbourhood of x. By the lemma with Yt = Y and g = 1, which is already proved, 
Pj:{xt < X + tUfY < Xt + tct) — > cf{x) for any sequences Xt ^ x and Ct — > c. 
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Hence this probability converges to zero for ct = tat\\Yt — Y\\oc, which satisfied 
Ct — *■ 0. We conclude from this that 

j\^9{y){'^Xt<X+tatYt<Xt+tbt — ixt<X+tatY<Xt+tbt)\ 

^ \\9{Y)\\oc-E\lxt<X+tatYt<Xt+tbt " ^Xt<X+tatY <Xt+tbt\ 

converges to zero. The proof of the lemma is complete upon combining the preced- 
ing. □ 

In order to verify (|19p with K — {ho} a, single function we can apply the preceding 
lemma with {X, Y) equal to the pair of variables (^rio{X), ho{X)), bt — (5„, and t — 
1/y/n. Then the conditions of the lemma require that the variable 'rio{X) possesses a 
continuous density, and that the conditional distribution of hQ{X) given i]o{X) — z 
depends continuously on the value of z. The second condition is clearly unpleasant, 
but appears to be natural in the present situation. It will involve a closer analysis 
of the support of the limit distribution of the sequence y/n(r\n — ?7o). 

To verify p9)) with a general compact set K C £°°{X, M) we simply note that in 
view of the compactness it suffices to verify that for every sequence /i„ such that 
\\hn - hoWoc for some ho € Ho, 

sup ^/nP {lna+n-^/^hr,<t+n-^/^Sr, ~ lj;o+n-i/2,j^<i) — > 0. 
teR'',a<t<6 

Thus we can apply the preceding lemma with the variables (X, Yj) equal to 
{^o[X)M{X),h^{,X)) and i = 1/^. 

Example 2, continued. Suppose that rjn is the empirical distribution of a ran- 
dom sample from the cumulative distribution function r]o on W^. Then the limit 
distribution of the sequence y/n{r]„ — rjo) is the d-dimensional r/o-Brownian sheet 
on R'^. 

If d = 1, then the Brownian sheet is a Brownian bridge and can be represented 
as B orjo for B a standard Brownian bridge on the unit interval. A typical function 
in the support of the limit distribution of the sequence \/n{r]n — rjo) can be repre- 
sented as ho = ho rjo for some function h : [0, 1] R. The conditional law of the 
variable ho{X) given rjoiX) = z is the Dirac measure at h{z). Because the standard 
Brownian bridge is continuous, the function h can be taken continuous and hence 
the corresponding Markov kernels K{z, •) = ^?i(z)(') are weakly continuous in z, as 
required by the preceding lemma. 

If d > 1, then we can, without loss of generality, suppose that 770 is a distribution 
function on [0, 1]'' with uniform marginal distributions (i.e. a copula function). Then 
the conditioning event rjo{X) ~ z will typically restrict X to a one-dimensional 
curve in [0, 1]''. Under sufficient smoothness of rjo, this curve will vary continuously 
with z, and under smoothness conditions on the law of X, the conditional distribu- 
tion of ho{X) given rjo^X) = z for a continuous function ho will vary continuously 
as well. Ghoudi and Remillard Q give sufficient conditions for this continuity in a 
number of examples. 

The preceding lemma can be extended to the case of multidimensional variables. 
For simplicity we only consider the two-dimensional case. 

Lemma 3.3. Suppose that X, Y, Yt (with t > 0) are random variables in defined 
on a common probability space such that 
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(i) X possesses a Lebesgue density f that has continuous conditional densities; 

(ii) \\Yt - y|ioo ^ and ||y|loo < oo; 

(Hi) the conditional distribution ofY given X ~ s can be represented by a Markov 
kernel K{s, ■) such that the map s i— > K(s, •) is continuous at x for the weak 
topology. 

Then for every continuous function 5 : M ^ M and every converging sequences 
Xt ^ X, at ^ a and 6t — > 6 > 0, as i — > 0, 

-^9iYt){lx+tatYt<xt+tbt - ix+tatYt<Xt) 

X2 



bi / 9iy)K{{xi,S2),dy) f{xi,S2)ds2 

J —00 J 

+ b2 / g{y) K{{si,X2),dy) f{si,X2)dsi. 



Proof. The event {X + tatYt < xt + tbt} n {X + tatYt < XtY can be decomposed 
in the three events 

/ = {xit < Xi+ tatYit < xit + tbit,X2 + tatY2t < X2t}, 
II = {xit < Xi+ tatYit < xit + tbit,X2t < X2+ tatY2t < X2t + i&2t}, 
/// = {Xi + tatYit < xit,X2t < X2+ tatY2t < X2t + tb2t}. 

In view of the boundedness of the Yt the event // is contained in an event of the 
form {X e Bi} for Bt rectangles of area 0{t^). Therefore, this event does not 
contribute to the hmit. 

The contribution of the event / with Yt = Y can be written 



ff(2/)l«i<atyi<«i+6it '^S2+taty2t<X2t K{{xit - Uit, S2), dy) 

X f{xit — Ult, S2) dui ds2. 
By arguments as given previously this can be shown to converge to 

.9(2/)l«i<ai;i<«i+6i K{{xi, S2),dy)ls2<x2 S2) dui ds2. 

The integral with respect to ui can be computed explicitly and the expression 
reduces to the first term on the right of the lemma. 

The contribution of the event /// gives the second term. 

We can replace Yt by Y by similar arguments as in the one-dimensional case. (In 
fact, bound X2t by oo and use exactly the same arguments.) □ 



4. Pseudo observations 

In this section we consider the asymptotic behaviour of the process {^Jni^^nO °r]n — 
POorjo) : 9 G Q} for a given class Q of functions : IR'' ^ R. The set-up is the same 
as in Section [31 As explained in the introduction we can decompose this process as 

G^niO o rin - 9 o rjo) + Gn9 o rjo + s/nP{9 o rjn - 9 o 770). 

Under the conditions of Theorem 13. II or Theorem 13. 2i Theorem 12. 3i or their exten- 
sions, the first term will converge to zero in probability in ^°°(0). The second term 
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will converge in distribution to a Gaussian process in this space if and only if the 
class of functions O is Donsker for the law Potiq^. If the third term also converges 
in distribution, then the sum of the three processes is asymptotically tight, and it 
will usually be straightforward to deduce its limit distribution from consideration 
of the marginal distributions. 

The behaviour of the third term will follow by the (functional) delta-method if 
the sequence \/n(r]n — Vo) converges in distribution in the Banach space H and the 
map 7] 1-^ {P9 o T] : 9 G Q) from H to £°°{Q) is suitably differentiable. If the limit 
distribution of the sequence \/n{rin — fjo) concentrates on the space Ha C H, then it 
suffices that the map r] i-^ (Pdor] : 6* G 8) be "Hadamard differentiable tangentially 
to iJo" , i.e. for every converging sequence ht ho £ Ho C H 



t 



uniformly in 9 G Q, for a continuous linear map L : liniJo £°°{ld). Under the 
additional condition that L is defined on all H, this implies 

V^P{9 orin-9o rio) = L{V^{f]n - iio)){9) + op(l). 

Cf. van der Vaart and Wellner page 374. 

As in the preceding section we consider the cases that the functions 9 are smooth 
or of bounded variation separately. In the former case the differentiability is relative 
to a weak norm on H (and is easy to prove), but for discontinuous functions 9, such 
as the indicator functions l(_oo,6»]j the differentiability requires a strong norm on 
H and some conditions on the underlying distribution. 



4-1. Smooth functions 

If the functions 9 are differentiable with bounded derivatives, then the Hadamard 
differentiability is true for H equipped with the Li(P)-norm on H. 

Lemma 4.1. Let the functions 9 : M.^ M. in be continuously differentiable with 
derivative 9 such that ||^(a;)|| < 1 for every x G K''. Then the map rj i-^ {P9 o rj : 
9 € Q) from Li{X,A, P) to is Hadamard differentiable at rjo with derivative 

h^ {P9oh:9 ee). 

Proof. Given a sequence ht with P\ht — /io| ^ we can write, by Fubini's theorem, 
^P{9{7^ + tht)-9{7])) -P9{r])h 

= [ P{9{r] + stht)ht-9(r])h)ds 
Jo 

< f P\\9i7j + stht)^Hn)\\\\ht\\ds + P\\9{r])\\\\ht-ho\\. 
Jo 

The second term on the right is bounded above by P\ht — ho\ and converges to 
zero by assumption. The first term on the right converges to zero by the dominated 
convergence theorem. □ 



4-2. Functions of bounded variation 



In the second result we let be a set of functions of bounded variation on a 
bounded interval in M, and consider the Hadamard differentiability of the map 
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rj I—!- {Pd o ?7 : 6' e 0) as a map from £°"{X) to £°°(8). For simplicity of notation, 
let X be a random variable with law P. 

Lemma 4.2. Let the functions 9 €z Q be distribution functions of subprobability 
measures supported on a compact interval / C K. Suppose that the variable rio{X) 
possesses a Lebesgue density f that is continuous on a neighbourhood of I . Then 
the map rj i-^ {P9 o r] : 9 G Q) from £°°{X) to £°°{Q) is Hadamard differentiable at 
rjo tangentially to the set of all Hq such that there exists a version of the conditional 
distribution of ho{X) given rio{X) = s that is weakly continuous in s G /. The 
derivative is given by Hq i— > (^J Fi(^ho{X)\r]o{X) = s) f{s)d9{s) : 9 G Q^. 

Proof. Let ho be as given and suppose ht ho in £°°{X). 

For given s G R and u > let Xs,u be the continuous function that takes the 
value on (— oo,s — u], takes the value 1 on [s,oo) and is linear on the interval 
[s — u,s]. Then l[s.oo) < Xs,u, and hence 

P{is<no+tht - ^s<vo) < P{Xs,u{Vo+tht) ~ XsAvq)) +PiXs,u{Vo) " l[s,oo)(?/o))- 

Because r]o{X) possesses a Lebesgue density that is bounded on a neighbourhood 
of / and l[s,oo) ~ Xs,u vanishes off the set (s — u, s), the second term on the right is 
bounded in absolute value by a multiple of u, uniformly in s ranging through /, for 
small u. By choosing u = St this term divided by t can be made arbitrarily small 
by choice of S. 

Because Xs,u is absolutely continuous with derivative 1/u on (s — m, s) and 
elsewhere, the first term on the right divided by 1/t can be written in the form 



For u = 5t this converges to 'E{ho{X)\rjo{X) = s^f{s), by Lemma [X^ imiformly in 
s ranging over J. 

It follows that, uniformly in s ranging over /, 



A similar argument using the functions x.s-\-u,u 

instead of Xs,u gives a corresponding 
lower bound, whence the expression in brackets converges to zero, uniformly in s 
ranging through compacta. This concludes the proof of the lemma for Q equal to 
the set of functions l[s,oo) with s in a compact interval. 

For a general collection Q of functions of bounded variation we can write 



Next we use the assumption that the functions 9 Q are supported on the compact 



The applicability of the second lemma depends on whether the set Ho of func- 
tions such that the conditional distribution of ho{X) given rio{X) = s is weakly 
continuous in s is large enough to support the limit distribution of the sequence 
\/n{r]n — r/o). As noted in the preceding section, under some smoothness conditions 
on r]o and on the distribution of X, the set Ho typically contains all continuous 
functions. Then it suffices that the sequence possesses a continuous weak limiting 
process. 






interval / with total variation bounded by 1. 



□ 
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5. Examples: completion 

In this section we return to two of the three examples discussed at the end of 
Section 3, Example 2 and Example 3. We give the theorems (and proofs) resulting 
from our approach. The general theme here is that the traditional results given in 
Corollaries 15.11 and l5.3l for indicator functions involve non-trivial restrictions on the 
underlying distribution 770 of the data, while the results for indexing by Lipschitz 
functions given in Corollaries 15.21 and 15.41 involve almost no restrictions on 770 (but 
significantly smoother indexing functions 9). 

5.1. Two corollaries for Kendall processes 

For the Kendall process, Example 2, it suffices to consider the case in which ryo is 
concentrated on [0, 1]'' and has uniform marginal distributions (i.e. is a copula func- 
tion), as noted by Ghoudi and Remillard We first give a corollary for indexing 
by indicator functions, and then a corollary for indexing by Lipschitz functions. 

Corollary 5.1. Suppose that for a given interval [a, 6] C (0, 1).' 

(i) The variable rio{X) possesses a density k with respect to Lebesgue measure 

that is continuous on a neighbourhood of [a,b]. 
(a) The conditional distribution of X given r]o{X) = s, has a regular version 

representable as a Markov kernel K{s, •) such that s 1— *■ K{s, ■) is continuous 

on [a, b] for the weak topology. 

Then the sequence of processes y/n[Kn ~ K) as in ill]) tends in [a, b] in distri- 
bution to the process (G^^/e : 9 £ [a, &]) for Gjjo 0.1^ rjo-Brownian bridge process and 
fg : [0, 1]'' R defined as 

fe{x) = 1^„(,)<, - ki9) E[l,<xMX) = 9]. 

Corollary 5.2. (Kendall processes, Example 2, indexed by Lipschitz functions). 
Suppose that Q is a suitably measurable collection of continuously differentiate 
functions 9 : [0,1] —t [—1,1] with derivatives 9 satisfying \\9{x)\\ < 1 for every 
X £ [0,1]. Then the sequence of processes "■""'^^^ X]"=i ^ P^ivo)) tends 

in distribution in £°°{Q) to the process (G,igfg : 9 £ &) for G,,o an rjo-Brownian 
bridge process in £°°{Q) and fg : [0, 1]*^ — *■ M defined as 

fe{x) = 9{rjo{x)) - P9{U<x). 

Proof of Corollary ] 5. 1\ We apply the decomposition ^ with fe.ri{x) — l{ri{x) < 
9}, for distribution fimctions 77 on [0, 1]'^, 9 £ [0, 1] and x £ [0, 1]''. 

As discussed following the proof of Lemma 3.2, hypotheses (i) and (ii) imply that 
the condition (fT9| for Theorem 3.2 (with d = 1) holds by way of Lemma 3.2, and 
hence the first term on the right side of ([2|) converges in probability to uniformly 
in 9 £ [a, b]. 

The second term is simply the usual empirical process for the i.i.d. one-dimen- 
sional random variables 770(^1)5 • ■ ■ j Voi^n), and hence it converges weakly as claim- 
ed by standard theory. 

To handle the third term, note that (i) and (ii) imply that the hypotheses of 
Lemma 4.2 hold, and hence that the map 77 {Pfg,rj ■ 9 £ Q} from £°°{X) 
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to £°°{[a,b]) is Hadamard differentiable tangentially to C([0, l]'^) with derivative 
L : C([0, 1]'^) ^ ^°°([a, b]) given by 

L{ho){9) = ~E{h„{X)\MX) = 9)k{d). 

Weak convergence of the third term then foUows from van der Vaart and Wellner 
[13], Theorem 3.9.5, page 375. 

The joint limit law of the second and third term can be determined from the 
marginals, and the limit of the sum of the two terms can be represented in the 
form as given. An insightful way to derive this is from asymptotic linearity of the 
two terms as follows. The second term is already linear with influence functions 
X I— > l^Q(j.)<g. The third term can be approximated by L{{^/n(rjn — ^y)) , where ry„ — 

V = n'^ T,7=i('^x,<x - vix)), so that L{{^{T]n - v)) = n^^^^ T,i=i L^x,,!] - v)- 
The terms in the latter sum should be understood as L acting on the functions 
X '—^ ^[Xi,i]{x) ~ vi^) for fixed Xi. We thus obtain that 

n 

HiMvn - h)) = n-^^''J2HMx..i]) - V^Hv) = G„oi(l[x,4])- 

i=l 

The representation of the limit process as given in the corollary follows. □ 

For many distribution functions ryo the corresponding density k oi K is un- 
bounded at and hence not continuous on [0, 1]. See Barbe, et al. Q, pages 202- 
208, for a number of explicit examples. In particular this is true even when 770 is 
the uniform distribution on [0, 1]''. For such distributions the preceding corollary 
does not yield convergence of Kendall's process in the space ^°°([0, 11). However, 
this convergence may be valid even when k is unbounded. Barbe et al. [2] show that 
under the growth condition 

kit) ^ o{t-'/\logil/t))-'/'-'), tlO, e>0. 

convergence in the full domain still holds. They achieve this using results of Alexan- 
der [ij to show that the empirical process yjnirjn — 77o)l{''7o > a„} converges in the 
weighted metric || • /g(r;o)||oo if q{t) = t^/'^(log{l/t))P for some 1/2 < p < r/2 
and Qn = n^^{lognY ■ This strengthening of the convergence of y/n{rin — ijo) then 
compensates for the growth of fc at 0. 

Proof of Corollarv \5.Sl This follows by combining Theorems 3.1 and Lemma 4.1 
with the fact that !F = {6 o t/q : 9 & Q} is Donsker. □ 

5.2. Two corollaries for copula processes 

For the copula processes in Example 3 it again suffices to consider the case 
in which 770 — C, so that all all the marginal distributions ?7oj , j — 1, . . • , d, are 
Uniform(0, 1). The first of the following two corollaries was obtained in Stute 0] 
and Ghoudi and Remillard Q. 

Corollary 5.3. Suppose that: 

(i) Vo = C is continuous. 

(ii) The copula function rjo — C is continously differentiable on [0,1]'' with gra- 
dient VC(it). 
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Then the sequence of copula processes \pn(Cn — C) given in converges in dis- 
tribution in ^"^([0, l]*^) to the process {G,jg fu : u G [0, 1]'') for Gjjg an rj^-Brownian 
bridge process, and fu ■ [0, 1]*^ ^ R defined as 

fu{x) = lx<u - VC(li)'(l2;i<„i, . . . , lxa<ua)- 

Corollary 5.4. (Copula processes, Example 3, indexed by Lipschitz functions). 
Suppose that Q is a suitably measurable collection of continuously differentiate 
functions 9 : [0, 1]'* M such that with derivative \\9{x)\\ < 1 for every x G [0, l]"* 
and satisfying J(l, Q, L2) < 00. Then the sequence of processes n~^/^ (^('7n(^i))~ 
P6{rio)) tends in distribution in to the process (Grio fe-.OeQ) for (U^Q an 

rjo-Brownian bridge process in i°"{Q) and fg : [0, l]'^ M defined as 

fg{x) = e{x) - pe{i,,<x, , . . . , lx.<xj . 

Proof of Corollary \5.'A We apply the decomposition ^ with 

fe,7^{x) = l{77i(a;i) < 6*1, ... , rid{xd) < Od} 

for 9 — {9i,...,9d) e [0,1]'*, X = (xi, . . . ,Xd) G [0, l]"*, and rjj the jth one- 
dimensional marginal distribution function on [0, 1] of the distribution function 
r] {so r]j{uj) =T]{l,...,l,Uj,l,...,l)). 

To show that the first term in ([2|) converges to zero uniformly in 9 & [0, 1]'', we 
can apply Theorem 2.1. The class of functions 

fg^^ix) = l{.Ti < ■■■,Xd< Vd\<)d)} 

is a class of indicators of a Vapnik-Chervonenkis-class of sets. Thus Theorem 2.1 
applies if we show that ([3]) holds. But this is easily verified by the assumed continuity 
of r]o — C and the uniform consistency of the empirical quantile functions ?7~J for 
j = 1, ... ,d. Thus (P) holds. 

The second term in ^ is simply the classical empirical process of the random 
vectors Xi, . . . , Xn in [0, l]**, and converges weakly by classical theory. 

Finally the third term in ^ converges weakly to S/Ciu)' ■ w)), for 

v{x,u) = (la;i<«i, ■ • ■ , la;d<«d), by the delta-method for the map r] Pfu,n- This 
map can be decomposed as 

ri^ {Vi'{ui),...,r^^\ud)) ^ (Co(ry-i(wi),...,7y^iM), [0,1]'* 

and can be shown to Hadamard-differentiable from the domain of distribution func- 
tions in = £°°{[0, 1]'*) to £°°{e) = £°°{[0, 1]'*) by the chain rule, using the 
continuity of VC and the fact that the quantile transformation is Hadamard dif- 
ferentiable. □ 

It is possible to extend Corollary 5.3 to the case in which VC is continuous on 
(0, l)** but satisfies certain growth restrictions at and/or 1. Then weighted metrics 
are involved in the proof. 

Proof of Corollary \5.4\ This follows by combining Theorems 3.1 and Lemma 4.1 
with the fact that = {6 : 6 E Q} is Donsker and the delta-method, e.g. van der 
Vaart and Wellner fl^], Theorem 3.9.5, page 375. □ 
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